International Education Studies; Vol. 7, No. 11; 2014 
ISSN 1913-9020 E-ISSN 1913-9039 
Published by Canadian Center of Science and Education 


Psychometric Properties on Lecturers’ Beliefs on Teaching Function: 

Rasch Model Analysis 

Samah Ali Mohsen Mofreh 1 ' 2 , Mohammed Najib Abdul Ghafar 1 , Abdul Hafiz Hj Omar 2 , Monsurat Mosaku 1 & 

Amar Ma’mf 1 

1 Faculty of Education, Universiti Tecknologi Malaysia, Malaysia 

2 Sport Innovation & Technology Center, FBME, Universiti Tecknologi Malaysia, Malaysia 

Correspondence: Samah Ali Mohsen Mofreh, Sport Innovation & Technology Center, FBME, Universiti 
Tecknologi Malaysia, Skudai 81310, Malaysia. Tel: 601-3771-0562. E-mail: samahm09@gmail.com 

Received: May 5, 2014 Accepted: August 27, 2014 Online Published: October 28, 2014 
doi:10.5539/ies.v7nllp47 URL: http://dx.doi.org/10.5539/ies.v7nllp47 


Abstract 

This paper focuses on the psychometric analysis of lecturers’ beliefs on teaching function (LBTF) survey using 
Rasch Model analysis. The sample comprised 34 Community Colleges’ lecturers. The Rasch Model is applied to 
produce specific measurements on the lecturers’ beliefs on teaching function in order to generalize results and 
inferential studies. The items proved to measure a single dimension of lecturers’ beliefs and how it influences the 
teaching function. The developed instrument termed LBTF covers six sub dimensions. Both construct and 
content validity were achieved through the Rasch Model analysis using the dimensionality, item fit, and item 
polarity parameters. The reliability of the instrument was achieved by conducting person and item separation 
analysis, Cronbach alpha, and calculated person and item reliability estimates. The results of Rasch Model 
analysis show that the items of LBTF fit the model appropriately. 

Keywords: psychometric analysis, Rasch model, beliefs, teaching functions, community colleges, validity, 
reliability 

1. Introduction 

Lecturers’ beliefs are developed throughout their lives and are influenced by a variety of factors, including 
events, experiences, and other people in their lives (Knowles, 1992). Where some beliefs are taken directly from 
the culture, others are culturally framed. These experiences shape their beliefs about students, curriculum 
development and the overall process of schooling (McGillicuddy-De Lisi & Subramanian, 1996; Aikenhead, 
2005). Shulman (1986) concluded that lecturers’ beliefs come from four sources: the accumulated 
knowledge-content, materials and educational structures, lecturer’s formal education, and “practical wisdom”, 
that is from practical experience. Furthermore, the sense of thinking and reasoning (developmental) processes 
stands out more than the acquisition of specific knowledge (OECD, 2009). Beliefs influence people’s knowledge 
acquisition and interpretation, task selection and organization, and ways of understanding (Mansor, 2010). 
Lecturers construct their own meaning of any curriculum as they (delve into it and initiate how to implement it in 
the classroom) negotiate an orientation towards it and decide what, if anything, to implement in their classroom 
(Aikenhead, 2005). Thus, lecturer participation in the curriculum planning process is considered essential, 
whether in the defining of problems or the presenting of concrete solutions in the form of programs of study. The 
failure of much curriculum innovation has been attributed to the neglect by innovators of teachers’ perceptions 
(Sutherland, 1981), and it seems that teachers’ own interests and concerns are only rarely allowed to influence or 
direct the choices made by curriculum developers (Ben-Peretz, 1980). 

Many studies have investigated the effects of teacher beliefs on instructional practices (Ball, 1996; Beswick, 
2005, 2007; Handal & Herrington, 2003; D. McLeod & S. McLeod, 2002), little research exists which precisely 
explore what happens when there is a conflict between these beliefs and current practices. For example, although, 
several studies show that beliefs have a strong impact on action, a research conducted in Turkey by Karaagap and 
Threlfall (2004) shows that lecturer’ goals, in particular when it “imposes” lecturers can bring to class 
practices that conflict with their beliefs). Lecturers need significant help in identifying the difference between 
their beliefs and practices espoused to think through to new inherent initiative culture (Standen, 2002). Lecturers 
update their beliefs in accordance to new forms of instruction, but unfortunately they are not changing their 
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current teaching methods (Quinn & Wilson, 1997). Mansour (2010) supported the idea that teachers are essential 
change agents for educational development and that teachers’ beliefs are precursors to change. Consequently, it 
is essential to take lectures’ beliefs and practices into account, as well as the factors that form or influence those 
beliefs and practices (Mansor, 2010). 

Developing the researcher’s own instrument requires knowledge about item or question construction, scale 
development, format, and length, validity and reliability of the instrument and its scores (Sekaran, 2003; 
Creswell, 2012; Johnson & Christensen, 2012). Measuring the lecturers’ beliefs needs a measurable instrument 
to be developed and tested in order to clarify how beliefs reform lecturers’ teaching functions. Thus, this study 
aims to use the Rasch Model analysis as a powerful tool for evaluating constructs validity and reliability of the 
instmment in order to use it for interpretation the lecturers’ beliefs on teaching functions. Furthermore, the 
research questions in this paper are Do the items of lecturers’ beliefs on teaching functions scale have adequate 
fit statistics, showing that each item relates to the variable and measurement tool in a meaningful way, and does 
the lecturers’ belief on teaching functions scale demonstrate high separation and good reliability in person and 
items set? 

1.1 Objective 

Based on the perspective of using Rasch Model (RM) as a model in one sense in that it represents the structure 
which data should exhibit in order to obtain measurements from the data; i.e. it provides a criterion for 
successful measurement. As such, measuring the lecturers’ beliefs needs a measurable instmment to be 
developed and tested in order to clarify how beliefs reform lecturers’ teaching functions. Thus, this study aims to 
use the. RM analysis as a powerful tool for evaluating constructs validity and reliability of the instmment in 
order to use it for interpretation the lecturers’ beliefs on teaching functions. 

2. Theoretical Framework 

2.1 Beliefs and Teaching Functions 

Learning is seen as the active construction constructing knowledge in the gradual expansion of networks of ideas 
through interaction with other people and materials in the environment (Marshall, 1992). Constructivism puts 
primary emphasis on the independence of the interpretation of each of their own experience and ways to build 
their lives and ideas (Newbrough, 1995; Roth, 1994; Gil-Perez et al., 2002; Mansour, 2009). However, before 
expecting lecturers to change their beliefs, they need to first be aware of them and then beliefs may be remained 
unconsciously (Crandall, 2000). Many researchers pointed out that the lecturers must rethink their role of 
teaching in order to facilitate communication situations appropriate to the nature of the various 
inter-relationships (between lecturer and student, between students and between lecturers, students and content) 
(Goodyear, Spector, Steeples, & Tickner, 2001; Coppola, Hiltz, & Rotter, 2002; Williams, 2003; Akdere & 
Marshall, 2005). Variety of inteipretations of the terms “lecturer’s functions” and “competencies” (Gonczi et al., 
1993, Eraut, 1998; Salmon, 2000, Goodyear et al., 2001; Westera, 2001; Anderson et al., 2003). Therefore, the 
concept of competency is used in various ways. Two approaches are clear defined the concept of competency; 
one viewing competency defines it as an observation skill sets as a personal skill or ability, related to the 
effectiveness of behavior, and another approach that includes skills such as strategic behavior, related to the 
ability to adjust performance requirements of the context (Eraut, 1998). The changes perceived by lecturers as 
required for teaching in virtual environments situation (Coppola et al., 2002). 

2.2 Basic Principles of Rasch Measurement Model 

Developing valid measures of abstract constructs is essential to advancement of psychological and educational 
researches. The most common practice in scale development consists of a administering a group of items 
intended to measure the same construct and subsequently aggregating the responses to form total a scale value. 
The items are weighted equally in the summation and treated as if all fall on an interval scale (Kindlon et al., 
1996). Weighting items equally implies that all items are of identical importance in assessing the construct. In 
addition, treating items as linear (equal interval) assumes that psychological distance between scale points (such 
as: strongly agree, agree, neutral, disagree, and strongly disagree) is the same throughout the item (Kindlon et al., 
1996). Moreover, reliability of scores, number of underlying constructs, and scale construction practices should 
empirically test the assumptions of equal-item weighting and the linear treatment of scales can represent the 
quantity of a trait possessed by an individual. Rasch Model (RM) is a model in one sense in that it represents the 
structure which data should exhibit in order to obtain measurements from the data; i.e. it provides a criterion for 
successful measurement. The RM, one of a group of models originating from item response theory, was initially 
developed in connection with the construction of ability tests. The model expresses Guttman’s basic ideas in a 
probabilistic manner, as follows: (a) given any item, a person of higher ability should have a higher probability 
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of getting the item right than would a person of lower ability, and (b) given any person, an item of lower 
difficulty should be solved (gotten right) with a higher probability. In the RM, the probability of a specified 
response (e.g. right/wrong answer) is modeled as a function of person and item parameters (Bond & Fox, 2007). 
Specifically, in the original RM, the probability of a correct response is modeled as a logistic function of the 
difference between the person and item parameter.RM analysis is a powerful tool for evaluating construct 
validity. Rasch fit statistics are indications of construct irrelevant variance and gaps on Rasch item-person map 
are indications of construct under-representation. There are some important aspects of RM measurement should 
be considered to understand the interpretations of its results analysis. 

2.2.1 Item Polarity 

Analysis of the polarity or consistency of the items are indicators used to show the items move in one direction 
to which the constructs being measured. A positive indicator show all items are moving in parallel function to 
measure the constructs formed. If there is a negative indicator for a particular item it should re-examine the data 
to be improved or removed either because of these indicators show that there are items or individuals who 
respond in difference with the construct (Linacre, 2003). Item polarity or point measure correlation (PTMEA 
Corr.) is the early detection of construct validity (Bond & Fox, 2007). 

2.2.2 Dimensionality 

Dimensionality aspects are important for determining the instrument was measured in one direction and one 
dimension (Linacre, 2003; Bond & Fox, 2007). Dimensionality aspect is one of the conditions in analysis using 
the RM. This is to ensure content validity and construct validity o f the instrument (Wu & Adams, 2007). 
Dimensionality refers to the forcing on one attribute or dimension at a time. The criteria for dimensionality is 
exceeding of 40 % (Linacre, 2003; Bond & Fox, 2007). 

2.2.3 Rating Scale Analysis 

One of the significant aspects of RM is determining the probability of participant responses equally spread 
between the scales. RM has ability to differentiate among scales of instrument based on data gathered. RM 
polychromous data analysis is used to determine whether it correspondents to the model or not. Polychromous 
RM can also measure the hypothesis of a scale in terms of adding value to the agreement or disagreement as 
moving from one continuum to another continuum (Linacre, 2003; Bond & Fox, 2007). Not all scales can be 
used for RM. If the structure calibration is less than 1.40 and more than 5 this scale should be collapsed (Linacre, 
2003; Bond & Fox, 2007). 

2.2.4 Item separation 

Item separation refers to all participants are able to answer all level difficulty of items. That means the 
participants can be separated based on those constructs that are measured. The criterion for usefulness of an 
instillment is exceeding its item separation (Linacre, 2007). A higher value of separation means greater spared of 
items and persons along a continuum. Lower values of separation indicate redundancy in items and less 
variability of persons on the trait. 

2.2.5 Item and Person Reliability 

Item and person reliability item reliability refers to the consistency of item placement along the pathway if these 
items were given to another sample of the same size that behaved the same way. While person reliability refers to 
the consistency of person ordering that could be accepted if this sample of persons were given a parallel set of 
items measuring the same construct (Wright & Masters, 1982). The criteria for accepting reliability RM is 
exceeding 0.50 (Linacre, 2007; Bond & Fox, 2007). 

2.2.6 Infit and Misfit 

Infit refers to the degree of fit of an item or a person. Infit means square is transformation of the residuals, the 
difference between the predicated and observed for easy interoperation. It expects value is 1. As a rule of thumb, 
values between 0.70 and 1.30 are generally regards as acceptable. Values greater than 1.30 are termed misfiting 
and those less than 0.70 as overfiting (Bond & Fox, 2007). Another values suggested by Linacare (2005) is 0.5 < 
x < 1.5. 

3. Method 

This study being a pilot study aims to test the validity and reliability of instrument in order to measure the 
lecturers’ beliefs on teaching function. Questionnaire was used as a survey design. The sample of this pilot study 
was 34 respondents who were lecturers at CC. The questionnaire includes 68 questions divided into 9 constructs 
which are classroom management includes 3 questions, curriculum knowledge includes 4 questions lesson plan 
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& presentation includes 5 questions, teaching strategies includes 15 questions, communication includes 4 
questions, assessing students’ learning includes 11 questions, prior knowledge includes 2 questions, and 
professionalism includes 20 questions respectively. The RM analysis investigated the validity and reliability of 
the LBTF instrument. The questionnaire was developed based on the standards criteria of RM analysis which are 
item dimensionality, item polarity, item fit analysis. Calibration scale instrument was undertaken during the 
implementation of pilot tests to assess the suitability of the scale of the LBTF instrument. Progressive scale of 
five categories was used for these instruments that consist of 1 = never, 2 = rarely, 3 = frequently, 4 = very 
frequently, and 5 = always. Some researchers agree that the optimal length of scales needs to determine by the 
nature of what is to be examined and extent to which respondents can discriminate among levels (Light et al., 
1990). 

4. Results 

Quantitative data of developed LBTF instrument was analyzed by using Winsteps version 3.68.2 in order to test 
the questionnaire items’ validity and reliability. Summary of RM analysis results of the developed LBTF 
questionnaire as followed. 

4.1 Dimensionality Analysis 

Dimensionality aspects are important for determining the instrument was measured in one direction and one 
dimension (Linacre, 2003; Bond & Fox, 2007). In Rasch analysis, a satisfactory dimensionality determined by 
raw variance explained by measures which should be more than 40% and unexplned variance in 1st contrast 
which should be < 15. Table 2 shows raw variance explained by measures was 53.0%, and unexplned variance in 
1st contrast was 6.1 %. Thus, dimensionality data results posts that the LBTF data fit the RM as illustrated in 
Table 1. 

Table 1. Item dimensionality of LBTF 




Empirical 


Modeled 

Total raw variance in observations 

162.4 


100. 0% 

100.0% 

Raw variance explained by measures 

86.4 


53.2% 

52.9% 

Raw variance explained by persons 

28.1 


17.3% 

17.25% 

Raw Variance explained by items 

58.4 


35.9% 

35.7% 

Raw unexplained variance (total) 

76.0 

46.8% 

100.0% 

47.1% 

Unexplned variance in 1st contrast 

10.0 

6.1% 

13.1% 


Unexplned variance in 2nd contrast 

8.3 

5.1% 

10.9% 


Unexplned variance in 3rd contrast 

7.7% 

4.7% 

10.1% 


Unexplned variance in 4th contrast 

6.8 

4.2% 

9.0% 


Unexplned variance in 5th contrast 

5.5 

3.4% 

7.3% 



4.2 Reliability Analysis 

Pilot test was conducted with 76 items for LBTF instrument, and with 36 items for LTPM instrument among 34 
lecturers of CC in Yemen. The criteria for accepting reliability in RM is exceeding 0.50 (Linacre, 2007; Bond & 
Fox, 2007). In addition, acceptable separation should be more than 2 (Fisher, 2007). Reliability reports on the 
consistency of a respondent’s answers to the items in the scale (Pedhazur & Schmelkin, 1991). The RM analysis 
measures reliability with person separation reliability. This statistic shows the ability of the items to separate 
persons with different levels of the concept measured. Rasch reliability of the items was compared with 
Cronbach’s alpha (CA). CA is a measure of internal consistency, and estimates the reliability of the scale by 
computing the variance between all possible pairs of items. Tables 2 and 3 show that the person and item 
reliability, person and item separation for LBTF instrument. As shown in Table 2 the person reliability was very 
high with .96, and the person separation was 4.68. Table 3 shows the item reliability was 0.93 and item 
separation was 3.59 which are acceptable. Therefore, the results of person and item reliability and person and 
item separation for LBTF indicate satisfactory readability for the LBTF instrument. Analysis of the study 
showed the reliability of 34 respondents with 67 items in theses constructs was high to measure the LBTF at 
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community colleges in Yemen. Thus, the reliability of item and person for LBTF instalment values are fairly 
close together and bother presenting a strong acceptable level. 


Table 2. Person separation and reliability 



Score 

Count 

Measure 

Error 

Infit 

Outfit 


IMSQ 

ZSTD 

OMSQ 

ZSTS 

Mean 

3.3.9 

76.0 

1.51 

.21 

1.05 

.0 

1.02 

-.1 

S.D 

36.6.6 

.0 

1.28 

.17 

.37 

1.8 

.76 

1.4 

Real RMSE 

.27 








ADJ. SD 

1.25 








Separation 

4.68 








Person liability 

.96 








em separation and reliability 


Score 

Count 

Measure 

Error 

Infit 

Outfit 


IMSQ 

ZSTD 

OMSQ 

ZSTS 

Mean 

136.9 

34.0 

.00 

.27 

1.02 

.0 

1.02 

.0 

S.D 

19.6 

.0 

1. 04 

.06 

.38 

1.3 

.76 

1.0 

Real rmse 

.28 








Adj. sd 

1.04 








Separation 

3.59 








Item reliability 

.93 









4.3 Item Polarity and Item Fit Analysis 

Item polarity analysis is the essential step for measuring the validity of the constructs. Item polarity or point 
measure correlation (PTMEA Corr.) is the early detection of construct validity (Bond & Fox, 2007). This 
analysis has the same function as factor analysis to access the relationship between the items in measuring the 
constructs that required. The good correlation values of items should be > .20. As shown in table 4 there is no 
negative value and all PTMEA of each item is > .20. Thus, the correlations of items indicate that there are no mis 
takes in data entry or miscoded items and this fit statistics with in the acceptable limit. 

In the analysis of appropriateness and inappropriateness items of LBTF, RM analyzed each construct separately. 
The Item measure can list the logit measurement information for each item. Appropriateness of items under 
schedule also shows the information for mean square (MNSQ) to make it easier for outlier detection or misfit. 
Table 4 shows the schedule of LBTF items of the constructs of classroom management (CM), Curriculum 
knowledge (CK), Lesson Plan Presentation (LPP), Teaching Strategies (TS), Communication and Relation with 
Students (CRS), Assessing Students’ Learning (ASL), Enhancing Professional Performance (EPP), and 
Integration (I). For the analysis of these constructs items, result expected MNSSQ infit analysis value should be 
0.4 <x <1.5, and PTMEA value should be+ 0.2 <x<l (Fox & Bond, 2007). It can be seen in the table 4 that all 
items fit. Therefore, the data are deemed acceptable for this study. 


Table 4. Item fit analysis 


Entry 

Total 

Count 

Measure 

Model 

Infit 

Outfit 

Pt-Measure 

Exact 

Match 

Items 

No 

Score 



S.E 

MNSQ 

ZSTD 

MNSQ 

ZSTD 

CORR. 

EXP. 

' OBS% 

EXP% 


i 

142 

34 

-0.13 

0.24 

1.02 

0.2 

6.92 

5.6 

A .26 

.45 

57.6 

51.7 

CM.l 

20 

127 

34 

0.63 

0.21 

1.54 

1.9 

1.25 

0.7 

K .47 

.53 

42.4 

46.0 

TS.20 
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63 

163 

34 

-2.03 

0.41 

1.44 

1.2 

1.11 

0.4 

L .24 

.26 

87.9 

81.0 

EPP.63 

60 

128 

34 

0.58 

0.21 

1.37 

1.4 

1.38 

1.0 

M .48 

.52 

21.2 

46.8 

EPP.60 

62 

153 

34 

-0.88 

0.29 

1.38 

1.3 

1.02 

0.2 

N .34 

.37 

45.5 

59.7 

EPP.62 

30 

156 

34 

-1.15 

0.31 

1.36 

1.2 

1.02 

0.2 

0 .33 

.34 

66.7 

64.0 

CRS.30 

8 

154 

34 

-0.97 

0.29 

1.35 

1.2 

1.20 

0.5 

P .32 

.36 

42.4 

61.0 

LPP.8 

37 

140 

34 

-0.02 

0.24 

1.33 

1.2 

1.04 

0.3 

Q .46 

.46 

48.5 

50.5 

ASL.37 

65 

164 

34 

-2.21 

0.44 

0.92 

0.0 

1.30 

0.7 

R .17 

.24 

84.8 

83.4 

EPP.65 

19 

127 

34 

0.63 

0.21 

1.29 

1.1 

1.25 

0.7 

S .42 

.53 

45.5 

46.0 

TS.19 

44 

134 

34 

0.30 

0.22 

1.28 

1.1 

1.25 

0.7 

T .49 

.50 

21.2 

47.4 

ASL.44 

27 

151 

34 

-0.73 

0.28 

1.26 

1.0 

0.96 

0.1 

U .42 

.39 

60.6 

58.8 

TS.27 

24 

113 

34 

1.19 

0.19 

1.21 

0.9 

1.25 

0.8 

V .54 

.58 

45.5 

44.5 

TS.24 

6 

150 

34 

-0.65 

0.27 

1.22 

0.8 

1.04 

0.3 

W .43 

.40 

63.6 

58.5 

CK.6 

52 

129 

34 

0.54 

0.21 

1.16 

0.7 

1.21 

0.6 

X .45 

.52 

33.3 

46.9 

EPP.52 

61 

129 

34 

0.54 

0.21 

1.19 

0.8 

1.15 

0.5 

Y .51 

.52 

30.3 

46.9 

EPP.61 

18 

129 

34 

0.54 

0.21 

1.18 

0.7 

1.17 

0.5 

Z .43 

.52 

48.3 

46.9 

TS.18 

21 

128 

34 

0.58 

0.21 

0.83 

-0.6 

0.75 

-0.6 

z .57 

.52 

45.5 

46.8 

TS.21 

17 

141 

34 

-0.07 

0.24 

0.82 

-0.6 

0.79 

-0.3 

y .46 

.46 

60.6 

51.0 

TS.17 

56 

139 

34 

-0.04 

0.23 

0.79 

-0.7 

0.74 

-0.4 

x .50 

.47 

60.6 

49.3 

EPP.56 

35 

143 

34 

0.19 

0.24 

0.77 

-0.8 

0.79 

-0.3 

w .46 

.45 

66.7 

52.2 

ASL.35 

53 

131 

34 

0.44 

0.22 

0.77 

-0.8 

0.74 

-0.6 

v .55 

.51 

51.5 

47.0 

EPP.53 

28 

162 

34 

-1.87 

0.39 

0.75 

-0.6 

0.54 

-0.8 

u .38 

27 

84.8 

77.6 

TS.28 

45 

151 

34 

-0.73 

0.28 

0.72 

-1.0 

0.64 

-0.5 

t .46 

.39 

69.7 

58.8 

ASL.45 

41 

130 

34 

0.49 

0.21 

0.71 

-1.1 

0.64 

-0.9 

s .60 

.52 

57.6 

47.1 

ASL.41 

11 

142 

34 

-0.13 

0.24 

0.70 

-1.2 

0.69 

-0.5 

r .50 

.45 

51.5 

51.7 

LPP.ll 

40 

131 

34 

0.44 

0.22 

0.69 

-1.2 

0.66 

-0.8 

q .61 

.51 

39.4 

47.0 

ASL.40 

49 

132 

34 

0.40 

0.22 

0.68 

-1.3 

0.63 

-0.9 

p .59 

.51 

54.5 

47.1 

EPP.49 

2 

139 

34 

0.04 

0.23 

0.67 

-1.3 

0.67 

-0.7 

o .55 

47 

42.4 

49.3 

CM.2 

55 

147 

34 

-0.44 

0.26 

0.65 

-1.4 

0.64 

-0.5 

n .50 

.42 

63.6 

56.1 

EPP.55 

67 

117 

34 

1.04 

0.20 

0.64 

-1.6 

0.61 

-1.2 

m .65 

57 

33.3 

44.9 

LEPP.67 

3 

137 

34 

0.15 

0.23 

0.63 

-1.5 

0.53 

-1.1 

1 .54 

.48 

75.8 

48.7 

CM.3 

50 

140 

34 

-0.02 

0.24 

0.63 

-1.5 

0.60 

-0.8 

k .54 

.46 

51.5 

50.5 

EPP.50 

23 

133 

34 

0.35 

0.22 

0.63 

-1.5 

0.62 

-0.9 

j -60 

.50 

54.5 

47.4 

TS.23 

32 

133 

34 

0.35 

0.22 

0.62 

-1.6 

0.59 

-1.0 

i .59 

.50 

48.5 

47.4 

CRS.32 

14 

118 

34 

1.00 

0.20 

0.62 

-1.8 

0.58 

-1.4 

h .63 

.57 

54.5 

45.0 

TS.14 

39 

128 

34 

0.58 

0.21 

0.61 

-1.7 

0.59 

-1.1 

g -63 

.52 

48.5 

46.8 

ASL.39 

47 

144 

34 

-0.25 

0.25 

0.57 

-1.8 

0.59 

-0.8 

f .54 

.44 

57.6 

52.3 

PKE.47 

16 

141 

34 

-0.07 

0.24 

0.57 

-1.8 

0.57 

-0.9 

e .52 

.46 

57.6 

51.0 

TS.16 

31 

144 

34 

-0.25 

0.25 

0.54 

-1.9 

0.55 

-0.9 

d .58 

.44 

63.6 

52.3 

CRS.31 

51 

138 

34 

0.09 

0.23 

0.47 

-2.4 

0.45 

-1.4 

c .59 

.48 

75.8 

49.2 

EPP.51 

43 

149 

34 

-0.58 

0.27 

0.45 

-2.4 

0.44 

-1.0 

b .55 

.41 

75.8 

58.1 

ASL.43 

13 

127 

34 

0.63 

0.21 

0.44 

-2.7 

0.42 

-1.8 

a .65 

.53 

72.7 

46.0 

LPP.13 
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4.4 Calibration Scaling Analysis 

RM analysis can also help to determine the validity of the scale used to make zero calibration setting and 
subsequent grading scale used. Rasch analysis determines the validity of the response probabilities being spread 
fairly across scales. Table 6 illustrates the schedules for grading scale calibration analysis of the LBTF survey 
method. 

Table 5 and figure 1 show a summary of the category structure on a scale gradation and size structure of the 
intersection. In the column arrangement observation (observed count) shows the respondents’ answers given to 
ranking scale. As shown in the Table 4, the most frequent answer is the scale of respondents rank ing 5 which 
983 (39 %). The next grading scale that respondents selected was scale 4 of 870 (35%). The scale 3 had 430 
(17%) respondents. While the least grading scale of least were scale 2 with 118 (5%) respondents, and scale of 1 
of 107 (4%) respondents. 

The observed averages show the pattern of respondents. Fairly normal pattern is expected with systematic 
instrument from negative to positive. As illustrated in Table 7, the response pattern obtained started from -1.65 
logit and moved up monotonously towards +2.62 logit signifying that the pattern of respondents’ answers are 
fairly normal. 


Table 5. Calibration scaling analysis 


Category 

Score 

Label 

Observed 

Observed 

Sample 

Infit 

Outfit 

Structure 

Category 


Count % 

Avg. 

Expect 

MNSQ 

MNSQ 

Calibration 

Measure 


1 1 

107 







1 

4 

118 

-1.65 

-1.56 

0.87 

0.85 

None 

(-2.48) 



2 2 

-0.09 

-0.17 

0.94 

0.84 

-0.91 

-1.18 

2 


5 









430 








3 3 

17 

0.66 

0.62 

1.08 

1.04 

-1.04 

-0.19 

3 


870 








4 4 

35 

1.27 

1.31 

1.05 

1.17 

0.25 

1.10 

4 


983 








5 5 

39 

2.62 

2.61 

1.05 

1.01 

1.70 

(2.95) 
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Figure 1. Summary of the category structure on a scale gradation 
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5. Conclusion 

This paper uses RM analysis to evaluate the usefulness of several items used in the LBTF as a measurement 
scale lecturers’ teaching functions. The items are identified according to theory and evaluated according to the 
Rasch Measurement Model using Winsteps software. This study as a psychometric study tested the validity and 
reliability of the LBTF in order to develop the LBTF instrument. Based on the results of the Rasch analysis 
measurement; item reliability was 0.87 > 0.50, item separation was 2.58 > 2.0, dimensionality, and evaluation of 
RM fit (infit <1.5) resulted in LBTF with good psychometric properties. 
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